Efficient Clustering Earth Mover's Distance

نویسندگان

  • Jenny Wagner
  • Björn Ommer
چکیده

The two-class clustering problem is formulated as an integer convex optimisation problem which determines the maximum of the Earth Movers Distance (EMD) between two classes, constructing a bipartite graph with minimum flow and maximum inter-class EMD between two sets. Subsequently including the nearest neighbours of the start point in feature space and calculating the EMD for this labellings quickly converges to a robust optimum. A histogram of grey values with the number of bins b as the only parameter is used as feature, which makes run time complexity independent of the number of pixels. After convergence in O(b) steps, spatial correlations can be taken into account by total variational smoothing. Testing the algorithm on real world images from commonly used databases reveals that it is competitive to state-of-theart methods, while it deterministically yields hard assignments without requiring any a priori knowledge of the input data or similarity matrices to be calculated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Fuzzy, Probabilistic, and Possibilistic Partitions Using the Earth Mover's Distance

A number of noteworthy techniques have been put forth recently in different research fields for comparing clusterings. Herein, we introduce a new method for comparing soft (fuzzy, probabilistic and possibilistic) partitions based on the earth mover’s distance (EMD) and the ordered weighted average (OWA). The proposed method is a metric, depending on the ground distance, for all but possibilisti...

متن کامل

Earth Mover's Distance and Equivalent Metrics for Spaces with Semigroups

introduce a multi-scale metric on a space equipped with a diffusion semigroup. We prove, under some technical conditions, that the norm dual to the space of Lipschitz functions with respect to this metric is equivalent to two other norms, one of which is a weighted sum of the averages at each scale, and one of which is a weighted sum of the difference of averages across scales. The notion of 's...

متن کامل

Circular Earth Mover's Distance for the comparison of local features

Many computer vision algorithms make use of local features, and rely on a systematic comparison of these features. The chosen dissimilarity measure is of crucial importance for the overall performances of these algorithms and has to be both robust and computationally efficient. Some of the most popular local features (like SIFT [4] descriptors) are based on one-dimensional circular histograms. ...

متن کامل

Metric-Preserving Reduction of Earth Mover's Distance

We prove that the earth mover’s distance problem reduces to a problem with half the number of constraints regardless of the ground distance, and propose a further reduced formulation when the ground distance comes from a graph with a homogeneous neighborhood structure. We also propose to apply our formulation to the non-negative matrix factorization.

متن کامل

The Tangent Earth Mover's Distance

We present a new histogram distance, the Tangent Earth Mover’s Distance (TEMD). The TEMD is a generalization of the Earth Mover’s Distance (EMD) that is invariant to some global transformations. Thus, like the EMD it is robust to local deformations. Additionally, it is robuster to global transformations such as global translations and rotations of the whole image. The TEMD is formulated as a li...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010